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Abstract 

Robinsonian matrices arise in the classical seriation problem and play an important role 
in many applications where unsorted similarity (or dissimilarity) information must be re¬ 
ordered. We present a new polynomial time algorithm to recognize Robinsonian matrices 
based on a new characterization of Robinsonian matrices in terms of straight enumerations 
of unit interval graphs. The algorithm is simple and is based essentially on lexicographic 
breadth-first search (Lex-BFS), using a divide-and-conquer strategy. When applied to a non¬ 
negative symmetric n x n matrix with m nonzero entries and given as a weighted adjacency 
list, it runs in 0{d{n + m)) time, where d is the depth of the recursion tree, which is at most 
the number of distinct nonzero entries of A. 

Keywords: Robinson (dis)similarity; unit interval graph; Lex-BFS; seriation; partition 
refinement; straight enumeration 


1 Introduction 

An important question in many classification problems is to find an order of a collection of 
objects respecting some given information about their pairwise (dis)similarities. The classic 
seriation problem, introduced by Robinson [34| for chronological dating, asks to order objects 
in such a way that similar objects are ordered close to each other, and it has many applications 
in different fields (see and references therein). 

A symmetric matrix A = is a Robinson similarity matrix if its entries decrease 

monotonically in the rows and columns when moving away from the main diagonal, i.e., if 
^ik < niin{j4jj, Ajfc} for all 1 < i < j < k < n. Given a set of n objects to order and a 
symmetric matrix A = (Aij) which represents their pairwise correlations, the seriation problem 
asks to find (if it exists) a permutation tt of [n] so that the permuted matrix A.,^ = (A.n.(q^(j)) is 
a Robinson matrix. If such a permutation exists then A is said to be a Robinsonian similarity, 
otherwise we say that data is affected by noise. The dehnitions extend to dissimilarity matrices: 
A is a Robinson(ian) dissimilarity preciely when —A is a Robinson(ian) similarity. Hence results 
can be directly transferred from one class to the other one. 

Robinsonian matrices play an important role in several hard combinatorial optimization 
problems and recognition algorithms are important in designing heuristic and approximation 

Correspondence to : M.Seminaroti@cwi.nl (M. Seminaroti), M.Laurent@cwi.nl (M. Laurent), CWI, Postbus 
94079, 1090 GB, Amsterdam. Tel.:+31 (0)20 592 4386. 


1 



algorithms when the Robinsonian property is desired but the data is affected by noise (see 
e.g. [6,15,24). In the last decades, different characterizations of Robinsonian matrices have 
appeared in the literature, leading to different polynomial time recognition algorithms. Most 
characterizations are in terms of interval (hyper)graphs. 

A graph G = {V,E) is an interval graph if its nodes can be labeled by intervals of the real 
line so that adjacent nodes correspond to intersecting intervals. Interval graphs arise frequently 
in applications and have been studied extensively in relation to hard optimization problems (see 
e.g. [^[^[^). A binary matrix has the consecutive ones property (CIP) if its columns can be 
reordered in such a way that the ones are consecutive in each row. Then a graph G is an interval 
graph if and only if its vertex-clique incidence matrix has CIP, where the rows are indexed by 
the vertices and the columns by the maximal cliques of G [^ . 

A hypergraph H = (V,£) is a generalization of the notion of graph where elements of £, 
called hyperedges, are subsets of V. The incidence matrix of H is the O/I matrix whose rows 
and columns are labeled, respectively, by the hyperedges and the vertices and with an entry I 
when the corresponding hyperedge contains the corresponding vertex. Then H is an interval 
hypergraph if its incidence matrix has CIP, i.e., if its vertices can be ordered in such a way that 
hyperedges are intervals. 

Given a dissimilarity matrix A G 5” and a scalar a, the threshold graph Ga = {V,Ea) has 
edge set Ea = {{x, y} : A^y < a} and, for x £ V, the ball B(x, a) := {y £ V : A^y < a} consists 
of X and its neighbors in Ga- Let B denote the collection of all the balls of A and Hq denote the 
corresponding ball hypergraph, with vertex set V = [n] and with B as set of hyperedges. One 
can also build the intersection graph Gjs of B, where the balls are the vertices and connecting 
two vertices if the corresponding balls intersect. Most of the existing algorithms are then based 
on the fact that a matrix A is Robinsonian if and only if the ball hypergraph Lfg is an interval 
hypergraph or, equivalently, if the intersection graph Gb is an interval graph (see 

Testing whether an m x n binary matrix with / ones has CIP can be done in linear time 
0(n-|-m-|-/) (see the first algorithm of Booth and Leuker [^ based on PQ-trees, the survey [^ 
and further references therein). Mirkin and Rodin [28] gave the first polynomial algorithm to 
recognize Robinsonian matrices, with O(n^) running time, based on checking whether the ball 
hypergraph is an interval hypergraph and using the PQ-tree algorithm of to check whether 
the incidence matrix has CIP. Later, Chepoi and Fichet [^ introduced a simpler algorithm 
that, using a divide-an-conquer strategy and sorting the entries of A, improved the running 
time to 0{n^). The same sorting preprocessing was used by Seston 37 , who improved the 
algorithm to 0{n^ log n) by constructing paths in the threshold graphs of A. Very recently, Prea 
and Fortin |30| presented a more sophisticated 0{n^) algorithm, which uses the fact that the 
maximal cliques of the graph Gg are in one-to-one correspondence with the row/column indices 
of A. Roughly speaking, they use the algorithm from Booth and Leuker [^ to compute a first 
PQ-tree which they update throughout the algorithm. 

A numerical spectral algorithm was introduced earlier by Atkins et al. for checking whether 
a similarity matrix A is Robinsonian, based on reordering the entries of the Fiedler eigenvector 
of the Laplacian matrix associated to A, and it runs in 0{n{T{n) + nlogn)) time, where T(n) 
is the complexity of computing (approximately) the eigenvalues of an n x n symmetric matrix. 

In this paper we introduce a new combinatorial algorithm to recognize Robinsonian matrices, 
based on characterizing them in terms of straight enumerations of unit interval graphs. Unit 
interval graphs are a subclass of interval graphs, where the intervals labeling the vertices are 
required to have unit length. As is well known, they can be recognized in linear time 0(|U| -|-|£'|) 
(see e.g. [8,22 and references therein). Many of the existing algorithms are based on the equiv- 
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alence between unit interval graphs and proper interval graphs (where the intervals should be 
pairwise incomparable) (see 


31,^). Unit interval graphs have been recently characterized in 


terms of straight enumerations, which are special orderings of the classes of the ‘undistinguisha- 
bility’ equivalence relation, calling two vertices undistinguishable if they have the same closed 
neighborhoods (see [^). This leads to alternative unit interval graph recognition algorithms 
(see [^[^), which we will use as main building block in our algorithm. Our algorithm relies 
indeed on the fact that a similarity matrix A is Robinsonian if and only if its level graphs (the 
analogues for similarities of the threshold graphs for dissimilarities) admit pairwise compatible 
straight enumerations (see Theorem]^. 

Our approach differs from the existing ones in the sense that it is not directly related to 
interval (hyper)graphs, but it relies only on unit interval graphs (which are a simpler graph 
class than interval graphs) and on their straight enumerations. Furthermore, our algorithm 
does not rely on any sophisticated external algorithm such as the Booth and Leuker algorithm 
for CIP and no preprocessing to order the data is needed. In fact, the most difficult task carried 
out by our algorithm is a Lexicographic Breadth-First Search (abbreviated Lex-BFS), which is 
a variant of the classic Breadth-First Search (BFS), where the ties in the search are broken by 
giving preference to those vertices whose neighbors have been visited earliest (see [35] and |19j). 
Following [^, we in fact use the variant Lex-BFS-I- introduced by 38 to compute straight 


enumerations. Our algorithm uses a divide-and-conquer strategy with a merging step, tailored 
to efficiently exploit the possible sparsity structure of the given similarity matrix A. Assuming 
the matrix A is given as an adjacency list of an undirected weighted graph, our algorithm runs 
in 0{d{m -I- n)) time, where n is the size of A, m is the number of nonzero entries of A and d 
is the depth of the recursion tree computed by the algorithm, which is upper bounded by the 
number L of distinct nonzero entries of A (see Theorem 14). Furthermore, we can return all the 
permutations reordering A as a Robinson matrix using a PQ-tree data structure on which we 
perform only a few simple operations (see Section 4.3). 

Our algorithm uncovers an interesting link between straight enumerations of unit interval 
graphs and Robinsonian matrices which, to the best of our knowledge, has not been made before. 
Moreover it provides an answer to an open question posed by M. Habib at the PRIMA Conference 
in Shanghai in June 2013, who asked whether it is possible to use Lex-BFS-I- to recognize 
Robinsonian matrices jllj . Alternatively one could check whether the incidence matrix M of 
the ball hypergraph of A has CIP, using the Lex-BFS based algorithm of [19] , in time 0{r+c+f) 
time if M is r X c with / ones. As r < nL, c = n and / < Lm, the overall time complexity 
is 0{L{n + m)). Interestingly, this approach is not mentioned by Habib. In comparison, an 
advantage of our approach is that it exploits the sparsity structure of the matrix A, as d can be 
smaller than L. 

This paper is an extended version of the work |23j , which appeared in the proceedings of the 
9th International Conference on Algorithms and Complexity (CIAC 2015) . 


Contents of the paper 

Section [^contains preliminaries about weak linear orders, straight enumerations and unit inter¬ 
val graphs. In Section]^ we characterize Robinsonian matrices in terms of straight enumerations 
of unit interval graphs. In Section we introduce our recursive algorithm to recognize Robin¬ 
sonian matrices, and then we discuss the complexity issues and explain how to return all the 
permutations reordering a given similarity matrix as a Robinson matrix. The final Section 
contains some questions for possible future work. 
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2 Preliminaries 


Throughout 5” denotes the set of symmetric n x n matrices. Given a permutation vr of [n] and 
a matrix A G S'"', At^ := (^ 7 r(j) 7 rQ'))fj=i £ 5"' is the matrix obtained by permuting both the 
rows and columns of A simultaneously according to vr. For U C [n], A[U] = {Aij)ij^u is the 
principal submatrix of A indexed by U. As we deal exclusively with Robinson(ian) similarities, 
when speaking of a Robinson(ian) matrix, we mean a Robinson(ian) similarity matrix. 

An ordered partition {Bi ,..., Bp) of a finite set V corresponds to a weak linear order V’ on 

V (and vice versa), by setting x y \i x,y belong to the same class Bi, and x y if x G Bi 

and y G Bj with i < j. Then we also use the notation ip = {Bi ,..., Bp) and Bi Bp. 

When all classes Bi are singletons then ip is a linear order (i.e., total order) of V. 

The reversal of ip is the weak linear order, denoted ip, of the reversed ordered partition 
{Bp,..., Bi). For U C V, ip[U] denotes the restriction of the weak linear order ip to U. Given 
disjoint subsets U,W C R, we say U <,/, W if x y for all x G U, y G W. If ipi and ip 2 are 
weak linear orders on disjoint sets Vi and V 2 , then ip = {ipi,ip 2 ) denotes their concatenation 
which is a weak linear order on Vi U V 2 . 

The following notions of compatibility and refinement will play an important role in our 
treatment. Two weak linear orders ipi and ip 2 on the same set V are said to be compatible if 
there do not exist elements x,y G V such that x y and y <i /,2 x. Hence, ipi and ip 2 are 
compatible if and only if there exists a linear order vr of R which is compatible with both ipi 

and ip 2 Then their common refinement is the weak linear order T = ipi A ip 2 on V defined by 

X y if X =. 0 ^ y for all i G {1, 2}, and x y if x y for all i G {1, 2} with at least one 
strict inequality. 

We will use the following fact, whose easy proof is omitted. 

Lemma 1. Let ipi,... ,ipL be weak linear orders on V. Hence, ipi,... ,ipL are pairwise compatible 
if and only if there exists a linear order n of V which is compatible with each of ipi,... ,ipL, in 
which case n is compatible with their common refinement ipi A ■ ■ ■ A ipL- 

In what follows R = [n] = {1,..., n} is the vertex set of a graph G = (R, E), whose edges are 
pairs {x, y} of distinct vertices x,y G V. For x G V, we denote by N{x) = {y G V : {x, y} G E} 
the neighborhood of x. Then, its closed neighborhood is the set iV[x] = {x}U A^(x). Two vertices 
x,y gV are undistinguishable if A^[x] = N[y]. This defines an equivalence relation on R, whose 
classes are called the blocks of G. Glearly, each block is a clique of G. Two distinct blocks B 
and R' are said to be adjacent if there exist two vertices x G B, y G B' that are adjacent in G 
or, equivalently, if R U R' is a clique of G. A straight enumeration of G is then a linear order 
(p = (Ri,..., Bp) of the blocks of G such that, for any block Bi, the block Rj and the blocks Bj 
adjacent to it are consecutive in the linear order (see [^). The blocks Ri and Bp are called the 
end blocks of (p and Bi (with 1 < i < p) are its inner blocks. Having a straight enumeration is a 
strong property, and not all graphs have one. In fact, this notion arises naturally in the context 
of unit interval graphs as recalled below. 

A graph G = {V = [n],E) is called an interval graph if its vertices can be mapped to 
intervals Ii,..., In of the real line such that, for distinct vertices x,y G V, {x,y} G E If and 
only if Ix r\ ly 0. Such a set of intervals is called a realization of G, and it is not unique. If 
the graph G admits a realization by unit intervals, then G is said to be a unit interval graph. 

Interval graphs and unit interval graphs play an important role in many applications in 
different fields. Many NP-complete graph problems can be solved in polynomial time on interval 
graphs (this holds e.g. for the bandwidth problem [^). However, there are still problems which 
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remains NP-hard also for interval graphs (this holds e.g. for the minimal linear arrangement 
problem (zl)- It is well known that interval graphs and unit interval graphs can be recognized 
in 0{\V\ + \E\) time (|[§{T§[T2|[T^[T§[^[2§[^. For a more complete overview on linear 
recognition algorithms for unit interval graphs, see and references therein. Most of the 
above mentioned algorithms are based on the equivalence between unit interval graphs and 
proper interval graphs (i.e., graphs admitting a realization by pairwise incomparable intervals) 
or indifference graphs . Furthermore, there exist several equivalent characterizations for unit 
interval graphs. The following one in terms of straight enumerations will play a central role 
in our paper. 

Theorem 2 (Unit interval graphs and straight enumerations), A graph G is a unit 
interval graph if and only if it has a straight enumeration. Moreover, if G is connected, then it 
has a unique (up to reversal) straight enumeration. 

On the other hand, if G is not connected, then any possible linear ordering of the connected 
components combined with any possible orientation of the straight enumeration of each con¬ 
nected component induces a straight enumeration of G. The next theorem summarizes several 
known characterizations for unit interval graphs, combining results from [9 
Recall that Ri ,3 is the graph with one degree-3 vertex connected to three degree-1 vertices (also 
known as claw). 

Theorem 3. The following are equivalent for a graph G = {V,E). 

(i) G is a unit interval graph. 

(a) G is an interval graph with no induced subgraph Ki^^. 

(Hi) (3-vertex condition) There is a linear ordering n of V such that, for all x,y,z G V, 

a; < 7 r y < 7 r {a:, z} £ E ^ {x, y}, {y, zj £ E. (1) 

(iv) (Neighborhood condition) There is a linear ordering n of V such that for any x £V 
the vertices in N[x] are consecutive with respect to vr. 

(v) (Clique condition) There is a linear ordering tt of V such that the vertices contained in 
any maximal clique of G are consecutive with respect to tt. 


17,26 29,31 


3 Robinsonian matrices and unit interval graphs 

In this section we characterize Robinsonian matrices in terms of straight enumerations of unit 
interval graphs. We focus first on binary Robinsonian matrices. We may view any symmetric 
binary matrix with all diagonal entries equal to 1 as the extended adjacency matrix of a graph. 
The equivalence between binary Robinsonian matrices and indifference graphs (and thus with 
unit interval graphs) was first shown by Roberts |31| . Furthermore, as observed, e.g., by Corneil 
et al. [^, the “neighborhood condition” for a graph is equivalent to its extended adjacency matrix 
having CIP. Hence we have the following equivalence between Robinsonian binary matrices and 
unit interval graphs, which also follows as a direct application of Theorem 

Lemma 4. Let G = {V, E) be a graph and Aq be its extended adjacency matrix. Then, Aq is 
a Robinsonian similarity if and only if G is a unit interval graph. 
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The next result characterizes the linear orders that reorder the extended adjacency matrix 
Ac as a Robinson matrix in terms of the straight enumerations of G. It is simple but will play 
a central role in our algorithm for recognizing Robinsonian similarities. 

Theorem 5. Let G = (R, E) he a graph. A linear order tt of V reorders Ac as a Robinson 
matrix if and only if there exists a straight enumeration of G whose corresponding weak linear 
order f) is compatible with vr, i.e., satisfies: 


Vx, y G V with x y 


X <ny 


X y. 


( 2 ) 


Proof. Assume that vr is a linear order of V that reorders Aq as a Robinson matrix. Then it is 
easy to see that the 3-vertex condition holds for vr and that each block of G is an interval w.r.t. 
vr. Therefore the order vr induces an order tp of the blocks: Bi ... <.0 Bp, with Bi <.0 Bj 
if and only if x <,r U for all x G Bi and y G Bj. In other words, is compatible with vr by 
construction. Moreover, tp defines a straight enumeration of G. Indeed, if Bi Bj Bk and 
Bi,Bk are adjacent then Bj is adjacent to Bi and since this property follows directly from 
the 3-vertex condition for vr. 

Conversely, assume that Bi Bp is a straight enumeration of G and let vr be a 

linear order of V which is compatible with pj, i.e., satisfies Q. We show that vr reorders Aq as 
a Robinson matrix. That is, we show that if x <jr y <tt z, then {Ag)xz < xa\n{{AG)xy, {AG)yz} 
or, equivalently, that {x,z} G E implies {x,y},{y,z} G E. If x,z belong to the same block Bi 
then y G Bi (using ([^) and thus {x, y}, {y, z} G E since Bi is a clique. Assume now that x G Bi, 
z G Bk and {x, z} G E. Then, Bi Bk and Bi, Bk are adjacent blocks and thus Bi U is a 
clique. If y G R* then y is adjacent to x and z (since Bi U R^ is a clique). Analogously if y G Rfc. 
Suppose now that y G Bj. Then, using Q, we have that R* <.^ Bj Bk. As V' is a straight 
enumeration with Bi,Bk adjacent it follows that Bj is adjacent to Bi and to Bk and thus y is 
adjacent to x and z. □ 


Hence, in order to find the permutations reordering a given binary matrix A as a Robinson 
matrix, it suffices to find all the possible straight enumerations of the corresponding graph G. 
As is shown e.g. in [9 13 , this is a simple task and can be done in linear time. This is coherent 
with the fact that CIP can be checked in linear time (see [1^ and references therein). 


We now consider a general (nonbinary) matrix A. We first introduce its ‘level graphs’, the 
analogues for similarity matrices of the threshold graphs for dissimilarities. Let ao < ai < 
■ ■ ■ < ol denote the distinct values taken by the entries of A. The graph = {V, Ei), whose 
edges are the pairs {x,y} with Axy > ai, is called the i-th level graph of A. Let J be the all 
ones matrix. Clearly, both J and — J are Robinson matrices. Hence, we may and will assume, 
without loss of generality, that ao = 0. Then, A is nonnegative and G^^^ is its support graph. 
The level graphs can be used to decompose A as a conic combination of binary matrices and, 
as already observed by Roberts [^, A is Robinson precisely when these binary matrices are 
Robinson. This is summarized in the next lemma, whose easy proof is omitted. 


Lemma 6. Let A G R"’ with distinct values oq < ai < ■ ■ ■ < ol and with level graphs 
G(i),...,G(^). Then: 

L 

A = aoJ + ^ («£ — a^-i) Ag.(o. 

i=i 

Moreover, A is Robinson if and only if Aq^i) is Robinson for each i G [L]. 
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Clearly, if ^4 is a Robinsonian matrix then the adjacency matrices of its level graphs {£ G [L]) 

are Robinsonian too. However, the converse is not true: it is easy to build a small example 
where A is not Robinsonian although the extended adjacency matrix of each of its level graphs 
is Robinsonian. The difficulty lies in the fact that one needs to find a permutation that reorders 
simultaneously the extended adjacency matrices of all the level graphs as Robinson matrices. 
Roberts first introduced a characterization of Robinsonian matrices in terms of indifference 
graphs (i.e. unit interval graphs). Rephrasing his result using the notion of level graphs, he 
showed that A is Robinsonian if and only if its level graphs have vertex linear orders that are 
compatible (see 33, Theorem 4.4]). However, he does not give any algorithmic insight on how 


to find such orders. 

Combining the links between binary Robinsonian matrices and unit interval graphs (Lemma|^ 
and between reorderings of binary Robinsonian matrices and straight enumerations of unit in¬ 
terval graphs (Theorem together with the decomposition result of Lemma we obtain the 
following characterization of Robinsonian matrices. 

Theorem 7. Let A £ S'^ with level graphs ..., G^^\ Then: 

(i) A is a Robinsonian matrix if and only if there exist straight enumerations of G^^\ ..., 
G^^') whose corresponding weak linear orders ifi,... are pairwise compatible. 


(ii) A linear order tt of V reorders A as a Robinson matrix if and only if there exist pairwise 
compatible straight enumerations of G^^\ ..., G^^\ whose corresponding common refine¬ 
ment is compatible with tt. 


Proof. Observe first that if assertion (ii) holds then (i) follows directly using the result of 
LemmaWe now prove (ii). Assume that A is Robinsonian and let vr a linear order of V that 
reorders A as a Robinson matrix. Then Aj^ is Robinson and thus, by lemma each permuted 
matrix (AQ(e))j^ is a Robinson matrix. Then, applying Theorem]^ for each £ £ [L], there 
exists a straight enumeration of G^^^ whose corresponding weak linear ordering ifi£ is compatible 
with vr. We can thus conclude that the common refinement of if i,..., fiz, is compatible in view 
of Lemma Conversely, assume that there exist straight enumerations of ..., G^^^ whose 
corresponding weak linear orders ifi,... ,ifL ai'e pairwise compatible with tt and their common 
refinement is compatible with tt. Then, by Theorem]^ vr reorders simultaneously each Ag,(£) as 
a Robinson matrix and thus is Robinson, which shows that A is Robinsonian. □ 


4 The algorithm 

We describe here our algorithm for recognizing whether a given symmetric nonnegative matrix A 
is Robinsonian. First, we introduce an algorithm which either returns a permutation reordering 
A as a Robinson matrix or states that A is not a Robinsonian matrix. Then, we show how to 
modify it in order to return all the permutations reordering A as a Robinson matrix. 

4.1 Overview of the algorithm 

The algorithm is based on Theorem The main idea is to find straight enumerations of the 
level graphs of A that are pairwise compatible and to compute their common refinement. The 
matrix A is not Robinsonian precisely when these objects cannot be found. As above, L denotes 
the number of distinct nonzero entries of A and throughout G^^^ = {V = [n], 1?^) is the Gth level 
graph, whose edges are the pairs {x,y} with A^y > a^, for £ £ [L\. 
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One of the main tasks in the algorithm is to find (if it exists) a straight enumeration of 
a graph G which is compatible with a given weak linear order ?/) of y. Roughly speaking, G 
will correspond to a level graph G^^'^ of A (in fact, to a connected component of it), while 'tp 
will correspond to the common refinement of the previous level graphs ..., Hence, 

looking for a straight enumeration of G compatible with ip will correspond to looking for a 
straight enumeration of compatible with previously selected straight enumerations of the 
previous level graphs ..., 

Since the straight enumerations of the level graphs might not be unique, it is important to 
choose, among all the possible straight enumerations, the ones that lead to a common refinement 
(if it exists). 

If G is a connected unit interval graph, its straight enumeration (p is unique up to reversal 
(see Theorem [^. On the other hand, if G is not connected then any possible ordering of 
the connected components induces a straight enumeration, obtained by concatenating straight 
enumerations of its connected components. This freedom in choosing the straight enumerations 
of the components is crucial in order to return all the Robinson orderings of A, and it is taken 
care of in Section 4A using PQ-trees. 

As we will see in Section 4.1.4, the choice of a straight enumeration of G compatible with pj 
reduces to correctly orient straight enumerations of the connected components of G. 

There are three main subroutines in our algorithm: CO-Lex-BFS (see Algorithm 0 , a 
variation of Lex-BFS, which finds and orders the connected components of the level graphs; 
Straight_enumeration (see Algorithm]^, which computes the straight enumeration of a con¬ 
nected graph as in [^; Refine (see Algorithm]^, a variation of partition refinement, which finds 
the common rehnement of two weak linear orders. These subroutines are used in the recursive 
algorithm Robinson (see Algorithm]^. 


4.1.1 Component ordering 

Our first subroutine is CO-Lex-BFS (where CO stands for ‘Component Ordering’) in Algo¬ 
rithm Given a graph G = {V,E) and a weak linear order ip of V, it detects the connected 
components of G and orders them in a compatible way with respect to ip. According to Lemma 
below, this is possible if G admits a straight enumeration compatible with ip. 

Lemma 8. Consider a graph G = {V,E) and a weak linear order ip of V. If G has a straight 
enumeration (p compatible with ip then there exists an ordering Hi,..., 14 of the connected com¬ 
ponents of G which is compatible with ip, i.e., such that Vi I 4 . 

Proof. If 14 ,..., 14 is the ordering of the components of G which is induced by the straight 
enumeration cp, i.e., Vi <(f, ... <(/, 14 , then 14 I 4 as cp is compatible with ip. □ 

Algorithmis based on the following observations. When the vertex p in the set S at line[T0| 
(which represents the current set of unvisited vertices with a tie, known as slice in Lex-BFS) 
has label 0, it means that p is not contained in the current component 14 , so a new component 
containing p is opened. Every time a connected component 14 has been completed, we check if 
it can be ordered along the already detected components in a compatible way with ip. We also 
do this for the last completed component I 4 , at the last iteration z = 0 at line 9 of Algorithm 
Let and Bff^^ denote respectively the first and the last blocks of ip intersecting 14,. We 
distinguish two cases: 

1 . if 14 meets more than one block of ip (i.e., if <,/, we check if all the inner 

blocks between R™™ and are contained in 14. If this is not the case, then the 




algorithm stops. Moreover the algorithm also stops if both V^j and K;_i meet exactly the 
same two blocks, i.e., and In both cases it is indeed not 

possible to order the components in a compatible way with ip. 

2. if 14 ) nieets only one block Bj^ of ip (i.e., I 4 4 and if this block B^ is the first block 
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of the previous connected component K;_i (i.e., then we swap Kj_i and V^} 

in order to make the ordering of the components compatible with ^|J. The ordering a is 
updated by setting, for each v G K;-i its new ordering as cj(u) + |K;| and for each u G 
as a{v) — |VLj-i|. Observe that if we are in the case when both and Kj-i are contained 
in Bh, then we do not need to do this swap, i.e., the two components K; and 14 i_i can be 
ordered arbitrarily. 

The next lemma shows the correctness of Algorithm 
Lemma 9. Let G = {V, E) be a graph and let ijj be a weak linear order of V. 

(i) If Algorithm^successfully terminates then the returned order Vi,... ,Vc of the components 
satisfies 

hi V^. 

(a) If Algorithm^ stops then no ordering of the components exists that is compatible with if. 

Proof, (i) Assume first that Algorithm successfully terminates and returns the linear ordering 
Vi,... ,Vc of the components. Suppose for contradiction that Kj-i K; for some uj G [c] with 
CO > 2. Then there exist x G Kj-i and y G Kj such that y x. Let 2 be the first vertex 
selected in the component K;_i. Then, 2 ; y (for if not the algorithm would have selected y 
before 2 : when opening the component K;-i). Let = {Bi,... ,Bp) and denote by and 

respectively, the first and last blocks of if meeting {BJ^ffl and B^fff are analogously 
defined). Say, x G Bj, y G Bi so that i < j, and 2 ; G B^. As 2 ; y, we have B^ <,/, B^. Suppose 
first that Br <ip Bi. Then, Bi is an inner block between and Bffff[ which is not contained 

in K;_i (since y G Bi), yielding a contradiction since the algorithm would have stopped when 
dealing with the component V^-i. Suppose now that Br = Bi. If has only one block B, 

then B <G Bi = and then the algorithm would have swapped 14; and I4;_i. Hence V'lK;] 

has at least two blocks and <.0 Bi Bj which is again a contradiction since 

the algorithm would have stopped. 

(a) Assume now that the algorithm stops after the completion of the component 14;. Then 
has at least two blocks. Suppose first that the algorithm stops because B^ff. Then 

clearly one cannot have I4;_i 14;. We show that we also cannot have 14 <tp K;-i- For this 

assume for contradiction that I 4 <ip 14-i- Let y be the first selected vertex in I 4 and let x be 
the first vertex selected in I4_i. Then, y G x <pj y (for if not the algorithm would have 

considered the component 14 before I4-i), and thus Bfffff H™™. If then 

the algorithm would have stopped earlier when examining 14 - 1 , since Bff!f[ B^™ 
and 2 I4-i- Hence, we have Bfffff = and, as V’[kL] has at least two blocks, there 
exists a vertex 2 ; G Bff^^ such that x z, which contradicts I 4 <V; K;-i- Suppose now that 
the algorithm stops because <,/, B <pj B™^^ and H ^ I 4 . Let x G Bff^'^,y G B™^ and 

2 ; G H \ 14 , and say 2 ; G 14'. Then we cannot have 14' <'ip 14 since x <pj z, and we also 
cannot have I 4 < 1 /; K;' since 2 : y. Hence the two components I 4 and 14' cannot be ordered 

compatibly with if and this concludes the proof. □ 

4.1.2 Straight enumerations 

Once the connected components of G are ordered, we need to compute a straight enumeration of 
each connected component G[I4]. We do this with the routine StraighLenumeration appplied to 
(G[I4], <7a;), where auj is a suitable given order of 14 (namely, auj = ci'[I4], where a is the vertex 
order returned by CO-Lex-BFS{G,fj)). This routine is essentially the 3-sweep unit interval 
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graph recognition algorithm of Corneil which, briefly, computes three times a Lex-BFS (each 
is named a sweep) and uses the vertex ordering coming from the previous sweep to break ties 
in the search for the next sweep. The only difference of Straight_enumeration{G[Vi^],ai^) with 
respect to Corneil’s algorithm is that we save the first sweep, because we use the order 
returned by CO-Lex-BFS. We now describe the routine Straight-enumeration which is based on 
the algorithms of §3] and §2], Below, deg(j(u) denotes the degree of the vertex v in G. 


Algorithm 2: Straight-enumeration{G, a) 
input: a connected graph G = {V,E) and a linear order a of V 
output: a straight enumeration (p of G, or STOP (G is not a unit interval graph) 


1 = Lex-BFS+{G, a) 

2 cr= Lex-BFS+{G,a 

3 i = 0 

4 L = B = 0 

5 for u = 1,..., |1/| do 


lmn{v) = min{ri : u G 
rmn{v) = max{n : u G A[u]} 
if rmn(v) — lmn{v) ^ deg(j(u) 

I stop 


10 

else 
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if lmn{v) = L and 

12 



Gi = GiU{v}. 

13 


else 

14 



L = lmn{v) 

15 



R = rmn{v) 

16 



i = i + 1 

17 



Gi = {u} 

18 


end 

19 

end 


20 end 



21 return (p = (Ci,..., Gq) 


(index of the blocks of ip) 
(dummy variables to record the current block Bi) 


then 


(leftmost vertex adjacent to v) 
(rightmost vertex adjacent to v) 

{G is not a unit interval graph) 


= R then 


Basically, after the last sweep of Lex-BFS, for each vertex v we define the leftmost vertex 
lmn{v) and the rightmost vertex rmn{v), according to , that are adjacent to v. Checking 
whether rmn{v) — lmn{v) = degg(u) corresponds exactly to checking whether the neighborhood 
condition holds for node v. The vertices with the same leftmost and rightmost vertex are then 
indistinguishable vertices, and they form a block of G. The order of the blocks follows the vertex 
order (T++. 

4.1.3 Refinement of weak linear orders 

Given two weak linear orders ip and (p on V, our second subroutine Refine in Algorithm 
computes their common refinement A </> (if it exists). 

We show the correctness of Algorithmic 

Lemma 10. If Algorithm\^returns a weak linear orders ofV, then is the common refinement 
of Ip and (p. If Algorithm^ returns = 0, then ip and cp are not compatible. 
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Algorithm 3: Refine{il^, (j)) 

input: two weak linear orders ^|J = {Bi, ..., Bp) and (p = (Ci,..., Cq) of V 
output: their common refinement ^ = ip A p, ot ^ 9 {tp and p are not compatible) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 
17 


i?max jg Qf ^ meeting Ci 

$ = 0 

if there exists a block B of p such that B B ^ Ci then 

I return 0 {p and p are not compatible) 

else 

W = V\Ci 

if Vk = 0 then 

I ^ = i^[Ci]) 

else 

I d> = iP[Ci], Refine{p[W],p[W])) 

end 

end 

if is a weak linear order of V then 
I return $ 
else 

I return 0 {p and p are not compatible) 

end 


Proof. The proof is by induction on the number q of blocks of p. If g = 1 then p = (V) is 
clearly compatible with p and the algorithm returns ^ = p as desired. Assume now q >2. Let 
W = F \ Cl- Then we can apply the induction assumption to p[W] and p\W] (which has q — f 
blocks). 

Assume first that the algorithm returns $ which is a weak linear order of V. We show that 
^ = p A p, i.e., that the following holds for all x,y € V: 


x=^y X y and x y, 

X <# y x y and x <^y with at least one strict inequality. 


(3) 


If X, y G Cl then x =0 y and ^ holds since <h[C'i] = p[Ci]. If x,y € V \ Ci, then ([^ holds by 
the induction assumption. Suppose now x G Ci and y G 1^ \ Ci. Then x y and x <$ y. We 
show that X y holds. For this let Bi (resp., Bj) be the block of p containing x (resp., y). 
Then Bi ^max meets Ci as x G Ci. Moreover, B'^^^ <.0 Bj, which implies x <.0 y. 

Indeed, if one would have Bj ^max^ then we would have = 0 (linein Algorithm]^, since 
Bj ^ Cl as y G Bj \ Ci, and thus would not be a weak linear order of V. 

Assume now that the returned <I> is not a weak linear order of F. If = 0 (line in 
Algorithm]^, then there is a block B ^max g^gj^ that B % Ci, and we can pick elements 
X G B \ Cl and y G Ci H B'^^^ so that y <0 x and x <0 y, which shows that p and p are not 
compatible. If <I> is a weak linear order of a subset U <ZV (line 16 in Algorithm]^, then it means 


that the weak linear order returned by the recursive routine Refine{p\W\,p\W]) is not a weak 
linear order of W (but of a subset) and thus, by the induction assumption, p\W] and p\W] are 
not compatible and thus p and p neither. This concludes the proof. □ 


4.1.4 Main algorithm 

We can now describe our main algorithm Robinson{A, p). Given a nonnegative matrix A G 5” 
and a weak linear order p of V = [n], it either returns a weak linear order $ of F compatible 
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with ^ and with straight enumerations of the level graphs of A, or it indicates that such does 
not exist. The idea behind the algorithm is the following. We use the subroutines CO-Lex-BFS 
and Straight-enumeration to order the components and compute the straight enumerations of 
the level graphs of A, and we refine them using the subroutine Refine. However, instead of 
refining the level graphs one by one on the full set V, we use a recursive algorithm based on 
a divide-and-conquer strategy, which refines smaller and smaller subgraphs of the level graphs 
obtained by restricting to the connected components and thus working independently with the 
corresponding principal submatrices of A. In this way we work with smaller subproblems and 
one may also skip some level graphs (as some principal submatrices of A may have fewer distinct 
nonzero entries). This recursive algorithm is Algorithmbelow. 


Algorithm 4: Robinson{A, tp) 

input: a nonnegative matrix A G 5” and a weak linear order ip of V = [n] 
output: a weak linear order <I> compatible with ip and with straight enumerations of all 
the level graphs of A, or STOP (such an order <I> does not exist) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 
19 


G is the support of A 

CO-Lex-BFS{G, ip) returns a linear order (lA,..., Vc) of the connected components of G 
compatible with ip (if it exists) and a vertex order a 
4 - = 0 

for w = 1,..., c do 

(puj = Straight-enumeration{G\yj\,a\yj\) (if G[I4;] is a unit int. graph) 

if = Refine{ip\yuP\, (puP) = 0 then 
if = RGfine{ip[VuP\,<p^) = 0 then 

I stop (no straight enumeration compatible with V’[Kj] exists) 

end 

end 

®min smallest nonzero entry of A\yj\ 

A'[I4i] is obtained from A\yj\ by setting entries with value to zero 
if A'[ 14 ;] is diagonal then 

I 4> = (4>,4>^) 

else 

I 4> = Robinson{A'[Vuj],^uj)) 

end 

end 

return: 


The algorithm Robinson{A, ip) works as follows. We are given as input a symmetric nonneg¬ 
ative matrix A G and a weak linear order ip of V = [n]. Let G be the support of A. First, 
we find the connected components of G and we order them in a compatible way with ip. If this 
is not possible, then we stop as there do not exist straight enumerations of the level graphs of 
A compatible with ip (Theorem]^. Otherwise, we initialize the weak linear order 4>, which at 
the end of the algorithm will represent a common refinement of the straight enumerations of 
the level graphs of A. In order to find 4>, we divide the problem over the connected components 
of G. The idea is then to work independently on each connected component 14; and to find 
its (unique up to reversal) straight enumeration and the common refinement of '!/^[I4;] 
and pui- 
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For each component K;, we compute the straight enumeration (j)^^ of if it exists, else 

we stop (Theorem]^. Since (j)oj is unique up to reversal, we check if either cji^j or (j)^ is compatible 
with V’[kL]- Specifically, we first compute the common refinement <1>^ of V’[K;] and (jitj- If h is 
nonempty we continue (Lemma 10), while if it is is empty we compute the common refinement 
‘htj of and If such a common refinement is nonempty we continue (Lemma 10), while 

if it is again empty this time we stop, as no straight enumeration of G^} compatible with 
exists. Finally, we set to zero the smallest nonzero entries of obtaining the new matrix 

(whose nonzero entries take fewer distinct values than the matrix ^[14;]). Now, if the 
matrix is diagonal, then we concatenate after in Otherwise, we make a 

recursive call, where the input of the recursive routine is the matrix and <I>^. If the 

algorithm successfully terminates, then the concatenation ..., (pc) will represent a straight 
enumeration of G, and = ($i,..., <kc) will represent the common refinement of this straight 
enumeration with the given weak linear order ip and with the level graphs of A. 

The final algorithm is Algorithm below. 


Algorithm 5: Robinsonian{A) 
input: a nonnegative matrix A G 5” 

output: a permutation tt such that A^^ is Robinson or stating that A is not Robinsonian 

1 ip = {V) 

2 if Robinson(A,'ip) stops then 

3 I “A is NOT Robinsonian” 

4 else 

5 ^=Robinson{A,^p) 

6 return: a linear order vr of R compatible with 

7 end 


Roughly speaking, every time we make a recursive call, we are basically passing to the next 
level graph of A. Hence, each recursive call can be visualized as the node of a recursion tree, 
whose root is defined by the first recursion in Algorithm and whose leaves (i.e. the pruned 
nodes) are the subproblems whose corresponding submatrices are diagonal. 

The correctness of Algorithm follows directly from the correctness of Algorithm which 
is shown by the next theorem. Indeed, assume that Algorithm is correct. Then, if Algorithm 
terminates then it computes a weak linear order $ compatible with straight enumerations of 
the level graphs of A and thus the returned order tt orders A as a Robinson matrix in view 
of Theorem 0 (n). On the other hand, if Algorithm stops then Algorithm stops with the 
input (A, V’ = (R))- Then no weak linear order d> exists which is compatible with straight 
enumerations of the level graphs of A and thus, in view of Theorem]^ (i), A is not Robinsonian. 

Theorem 11. Consider a weak linear order ip of V = [n] and a nonnegative matrix A G 
ordered compatibly with ip. 

(i) If Algorithm^ terminates, then there exist straight enumerations (p^^\ ..., of the level 
graphs G^^\ ..., G^^'i of A such that the returned weak linear order <I> is compatible with 
each of them and with ip. 

(ii) If Algorithm^stops then there do not exist straight enumerations of the level graphs of A 
that are pairwise compatible and compatible with ip. 


14 








Proof. The proof is by induction on the number L of distinct nonzero entries of the matrix 
A. We first consider the base case L = 1, i.e., when A is (up to scaling) 0/1 valued. We 
first show (i) and assume that the algorithm terminates successfully and returns <1>. Then G 
is the support of A, CO-Lex-BFS{G,if) orders the components of G as Vi V/, and 

$ = (<hi,..., <hc) where each is build as the common refinement of and a 

straight enumeration of G[V/,] (either or Hence G has a straight enumeration (j) and the 
returned <1> is compatible with (f and V’- 

We now show (ii) and assume that Algorithm stops. If it stops when applying CO-Lex- 
BFS{G, fj), then no order of the components of G exists that is compatible with and thus no 
straight enumeration of G exists that is compatible with if (Lemma [^. If the algorithm stops 
when applying Straight-enumeration to then no straight enumeration of G[I4;] exists. 

Else, if the algorithm stops at line in Algorithm^ then is not compatible with neither 

(fuj nor Because G[I4;] is connected, (ftj and (j)^ are its unique straight enumerations (see 
Theorem]^ and therefore no straight enumeration of G[I4,] is compatible with V’lbL]- In both 
cases, no straight enumeration of G exists that is compatible with f;. 


We now assume that Theorem 11 holds for any matrix whose entries take at most L — 1 
distinct nonzero values. We show that the theorem holds when considering A whose nonzero 
entries take L distinct values. We follow the same lines as the above proof for the case L = 1, 
except that we use recursion for some components. First, assume that the algorithm terminates 
and returns <h. Then, $ = (d>i,..., <hc) after ordering the components compatibly with ^|J as 
Vi • • • —b constructing the common refinement <I>^ of V’[^] ^.nd a straight enumeration 

(say) (fco of and having = Robinson{A'[Vij],^i^), where A'[I4,] is obtained from A\VJ\ 

by setting to 0 its entries with smallest nonzero value. By the induction assumption, is 
compatible with straight enumerations of the level graphs of the matrix A'[I4;] and with As 

is compatible with <I>^, which refines both "(/[K;] and 4>uj: h follows that is compatible 
with V’[Kj] and (fui- Therefore, is compatible with straight enumerations of all the level 
graphs of A\yj\ and thus <I> = (d>i,..., $c) is compatible with ip and all level graphs of A, as 
desired. 

Assume now that the algorithm stops. If the algorithm stops at CO-Lex-BFS{G,ip), then 
no linear order of the connected components of G exists that is compatible with ip and then 
no straight enumeration of G exists that is compatible with ip (Lemma , giving the desired 
conclusion. If the algorithm stops at line then a connected component 14; is found for 
which '(/’[Kj] is not compatible with any straight enumeration of G[V/,], giving again the desired 
conclusion. 

Assume now that the algorithm stops at line|16t i.e., there is a component V/, for which the 
algorithm terminates when applying Robinson{A'[V^j], $ 0 ;). Then, by the induction assumption, 
we know that: 


no straight enumerations of the level graphs of A'[14;] exist 
that are pariwise compatible and compatible with <I>tj, 


where $ 0 ; is the common refinement of V'IKj] and a straight enumeration (say) (p^) of G[I4;]. 
Assume, for the sake of contradiction, that there exist straight enumerations ..., of 
the level graphs G^^^ = G,..., G^^^ of A, that are pairwise compatible and compatible with ip. In 
particular, is a straight enumeration of G[I4;] compatible with V'lK;]- If ¥^*'^^[ 14 ;] = (pui, 

then the restrictions {i > 2) yield straight enumerations of the level graphs of A'[I4;] that 
are pairwise compatible and compatible with and V'iKj]; and thus with their refinement = 
'f’[yu}]F(pui, contradicting (Q. Hence, so that ip\yj\ is compatible with both (pi^ and 

its reversal <p^^. This implies that V’[K;] = (K;)- But then the reversals [14;]; • ■ •,^*'^^14;] 
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provide straight enumerations of the level graphs of that are pairwise compatible and 

compatible with = 4>io = This contradicts again Q and concludes the proof. □ 


4.2 Complexity analysis 


We now study the complexity of our main algorithm. First we discuss the complexity of the two 
subroutines CO-Lex-BFS and Refine in Algorithms 1 and 2 and then we derive the complexity 
of the final Algorithm 4. In the rest of the section, we let m denote the number of nonzero 
(upper diagonal) entries of A, so that m is the number of edges of the support graph G = 
and m = \Ei\ > \E 2 \ > ... > \El\ for the level graphs of A. We assume that A is a nonnegative 
symmetric matrix, which is given as an adjacency list of an undirected weighted graph, where 
each vertex x G C is linked to the list of vertex/weight pairs corresponding to the neighbors y 
of X in G with nonzero entry Axy. 

A simple but important observation that we will repeatedly use is that, given a weak linear 
order of V, we can assume the vertices V to be ordered according to a linear order r of V 
compatible with Then, the blocks of V’ are intervals of the order r and thus one can check 
whether a given set C C 1/ is contained in a block i? of V’ in 0{\G\) operations (simply by 
comparing each element of G to the end points of the interval B). Furthermore, the size of any 
block of il) is simply given by the difference between its extremities (plus one). 


Lemma 12. Algorithm^runs in 0(|I^| + |Fi|) time. 

Proof. It is well known that Lex-BFS can be implemented in linear time 0(|y| + |i?|). In our 
implementation of Algorithm we will follow the linear time implementation of Corned [^, 
which uses the data structure based on the paradigm of “partitioning” presented in 19 . Recall 


that the blocks of if are intervals in r, which is a linear order compatible with if. In order to 
carry out the other operations about the components of G we maintain a doubly linked list, 
where each node of the list represents a connected component V/, of G and it has a pointer to 
the connected component V^j-i ordered immediately before V/, and to the connected component 
V/;+i ordered immediately after V/,. Then, swapping two connected components can be done 
simply by swapping the left and right pointers of the corresponding connected components in 
the doubly linked list. Furthermore, each node in this list contains the set of vertices in V/,, the 
first block and the last block in if meeting 14.. These two blocks Bfif™ and 

can be found in time 0(|I4|) as follows. First one finds the smallest element Umin (resp. the 
largest element Umax) of 14 in the order r, which can be done in 0(|I4|). Then, Bffi™ is the 
block of if containing Uminj which can be found in 0(|I4|)- Analogously for which is the 

block of if containing Umax- Checking whether 14 is contained in the block B^fi\ can be done 
in 0(|I4|) (since is an interval). In order to check whether all the inner blocks between 

R™™ and are contained in 14 we proceed as follows. Let B^^ be the union of these inner 

blocks, which is an interval of r. First we compute the sets 14 C and 14 C which can 

be done in 0(|I4|)- Then we need to check whether B^^ C I 4 or, equivalently, whether the two 
sets 14 \ (R™™ U and B^ are equal. For this we check first whether I 4 \ (R™™ U is 

contained in B^ (in time 0(|I4|)) and then whether these two sets have the same cardinality, 
which can be done in 0(|I4|)- Hence, the complexity of this task is 0(^^|I4|) = 0(|H|). 
Therefore we can conclude that the overall complexity of Algorithmis 0(|H| + |F1|). □ 

Lemma 13. Algorithm^ runs in 0(|H|) time. 


Proof. We show the lemma using induction on the number q of blocks of (p. Recall that the 
blocks of if are intervals in r, which is a linear order compatible with if. If q = 1 the result is 
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clear since the algorithm returns $ = 0 without any work. Assume q > 2. The first task is to 
compute the last block of V’ meeting Ci. For this, as in the proof of the previous lemma, 

one finds the largest element Umax of Ci in the order r and one returns the block of ip containing 
Umax, which Can be done in 0(|Ci|). Then let B be the union of the blocks preceding B^^^. In 
order to check whether B <Z Ci or, equivalently, whether Ci \ B'^^^ = B, we proceed as in the 
previous lemma: we first check whether Ci \ B'^^^ C B and then whether \Ci \ B'^^\ = \B\, 
which can be done in 0(|Ci|). Hence, the running time is Ode'll) for this task which, together 
with the running time 0(|H \ Cij) for the recursive application of Refine to the restrictions of 
ijj and (p to the set H \ Oi, gives an overall running time 0 (|H|). □ 

We can now complete the complexity analysis of our algorithm. 

Theorem 14. Let A be a nonnegative n x n symmetric matrix given as a weighted adjacency 
list and let m be the number of (upper diagonal) nonzero entries of A. Algorithm^ recognizes 
whether A is a Robinsonian matrix in time 0{d{n + m)), where d is the depth of the recursion 
tree created by Algorithm\^ Moreover, d < L, where L is the number of distinct nonzero entries 
of A. 

Proof. We show the result using induction on the depth d of the recursion tree. In Algorithm 
we are given a matrix A and its support graph G, and we set ^p = {V = [n]). First we run the 
routine CO-Lex-BFS{G,fi) in 0(n + m) time, in order to find and order the components of G. 
For each component W,, the following tasks are performed. We compute a straight enumeration 
<puj of G[I 4 ,], in time 0{nu) + mui) where = |I 4 ,| and is the number of edges of 
The reversal cp^ can be computed in 0(|ldj|) by simply reversing the ordered partition 0^^, 
which is stored in a double linked list. Hence, we apply the routine Refine to 'tp[\uj] and (pi^ 
(or (pi^), which can be done in 0{nu}) time. Then we build the new matrix and checks 

whether it is diagonal, in time Finally, by the induction assumption, the recursion step 

Robinson{A'[Vi^], <I>tj) is carried out in time 0{di^{ni^ + where d^j denotes the depth of the 

corresponding recursion tree. As d^j < d — 1 for each w, after summing up, we find that the 
overall complexity is 0{d{n + m)). The last claim: d < L is clear since the number of distinct 
nonzero entries of the current matrix decreases by at least 1 at each recursion node. □ 

4.3 Finding all Robinsonian orderings 

In general, there might exist several permutations reordering a given matrix A as a Robinson 
matrix. We show here how to return all Robinson orderings of a given matrix A, using the 
PQ-tree data structure of [^. 

A PQ-tree T is a special rooted ordered tree. The leaves are in one-to-one correspondence 
with the elements of the groundset V and their order gives a linear order of V. The nodes of 
T can be of two types, depending on how their children can be ordered. Namely, for a P-node 
(represented by a circle), its children may be arbitrary reordered; for a Q-node (represented by 
a rectangle), only the order of its children may be reversed. Moreover, every node has at least 
two children. Given a node a oiT,Ta denotes the subtree of T with root a. 

A straight enumeration ip = {Bi, ..., Bp) of a graph G = {V, E) corresponds in a unique way 
to a PQ-tree T as follows. If G is connected, then the root of T is a Q-node, denoted 7 , and it 
has children fii,..., f3p (in that order). For i G [p], the node fii is a P-node corresponding to the 
block Bi and its children are the elements of the set Bi , which are the leaves of the subtree Tg.. 
If a block Bi is a singleton then no node fii appears and the element of R is directly a child of 
the root 7 (see the example in Figure [^. 
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Figure 1: 


A connected graph G and the PQ-tree corresponding to its straight enumeration 


If G is not connected, let Pi,..., 14 be its connected components. For each connected 
component G[V4], 74 is its PQ-tree (with root 7 ^) as indicated above. Then, the full PQ-tree T 
is obtained by inserting a P-node a as ancestor, whose children are the subtrees Ti ,(see 
Figure]^. 



Figure 2: The PQ-tree corresponding to the straight enumeration of a disconnected graph 


We now indicate how to modify Algorithms and in order to return a PQ-tree T encoding 
all the permutations ordering A as a Robinson matrix. 

We modify Algorithm by taking as input, beside the matrix A and the weak linear order 
ijj, also a node a. Then, the output is a PQ-tree 74 rooted in a, representing all the possible 
weak linear orders compatible with ?/) and with straight enumerations of all the level graphs 
of A. It works as follows. 

Let G be the support of A. The idea is to recursively build a tree 74 for each connected 
component V4 of G and then to merge these trees according to the order of the components found 
by the routine CO-Lex-BFS{G,'ip). To carry out this merging step we classify the components 
into the following three groups: 

1. 0, which consists of all oj G [c] for which the connected component V4 meets at least two 
blocks of ijj. 

2. A, which consists of all a; G [c] for which the component V4 is contained in some block Bi, 
which contains no other component. 

3. n = where flj consists of all u G [c] for which the component V4 is contained in 

the block Bi, which contains at least two components. 

Every time we analyze a new connected component ui G [c] in Algorithm we create a Q-node 
7 [j. After the common refinement <I>tj (of V'[V4] and the straight enumeration (f)^^ of G[V4] or its 
reversal) has been computed, we have two possibilities. If A'[V4] is diagonal, then we build the 
tree 74 rooted in 7^^, and whose children are P-nodes corresponding to the blocks of <I>^ (and 
prune the recursion tree at this node). Otherwise, we build the tree 74 recursively as output of 
Robinson(A'[V 4 ], <h^,7t^). 

After all the connected components have been analyzed, we insert the trees 74 in the final tree 
74 in the order they appear according to the routine CO-Lex-BFS{G,'tp). The root node is a 
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and is given as input. For each component K;, we do the following operation to insert 7L in 7^, 
depending on the type of the component Kj: 

1. If a; G 0, then (or is the only straight enumeration compatible with Then 

we delete the node 7 ^; and the children of become children of a (in the same order). 

2. If a; G A, then both and its reversal are compatible with V'lKj]- Then 7 ^^ becomes 
a child of a. 

3. If cj G 0,1 for some i G [p], then both (j)^ and (j)^ are compatible with ip[Vco] and the same 
holds for any oj' G flj. Moreover, arbitrary permuting any two connected components 
14), 14)' with co,u}' G will lead to a compatible straight enumeration. Then we insert a 
new node /?* which is a P-node and becomes a child of a and, for each oj' £ Oi, 7 ^/ becomes 
a child of /?*. 
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Algorithm 6: Robinson{A,il),a) 

input: a nonnegative matrix A G iS"", a weak linear order i/j of V = [n] and a node a 
output: A PQ-tree 7^ representing all the possible weak linear orders compatible with 
■i/’ and with straight enumerations of all the level graphs of A, or STOP (such a 
tree does not exist) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 


G is the support of A 

CO-Lex-BFS{G, V’) returns a linear order (Vi,..., 14) of the connected components of G 

compatible with (if it exists) and a vertex order a 

group the connected components (c.c.) K; (w G [c]) of G as follows: 

0 : all oj for which 14 meets at least two blocks of V' 

A : all a; for which 14 is contained in a block Bi containing no other connected component 
for i G [p], all a; for which Vui F Bi and Bi contains at least two connected components 
$ = 0 

for a; = 1 ,..., c do 

create a Q-node 7 ^^ 

(puj = StraighLenumeration{G[Vu;],a[Vuj]) (if G[14] is a unit int. graph) 

if = Refine{ip[Vui], poj) = 0 theu 
if $ 0 ; = Refine{ip[Vuj], = 0 theu 

I stop (no straight enumeration compatible with i/’[ 14 ] exists) 

end 

end 

®min the smallest nonzero entry of A [ 14 ] 

A'[ 14 ] is obtained from A[14] by setting entries with value to zero 
if A'[ 14 ] is diagonal then 

create a PQ-tree 74 rooted in and whose children are P-nodes corresponding to 
the blocks of 
else 

I 74 = Robinson{A'[Vuj],^u;,7oj) 

end 

end 

74 is the PQ-tree rooted in a, build as follows: 

00 = 1 

while 00 < c do 
if (U G 0 then 

I the children of 7 ^^ become children of a and remove 7 ^;; w = a; -|- 1 
else 

if (U G A then 

I set 74 as child of a (if a = 0, then set a = 7 ^^); w = a; -|- 1 
else 

let Qi s.t. 00 G create a P-node (3i and set it as child of a (if a = 0, then set 
a = Pi) 

foreach 00 ' G do 

I set 7 tj/ as children of Pi 

end 

uj = io 117 j I 

end 

end 

end 

return: 74 
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Finally, we modify Algorithm by just giving the node a = 0 (i.e. undefined) as input 
to the first recursive call. The overall complexity of the algorithm after the above mentioned 
modifications is the same as for Algorithm Indeed, determining the type of the connected 
components can be done in linear time, by just using the information about the initial and final 
blocks and already provided in Algorithm Furthermore, the operations on the 

PQ-tree are basic operations that do not increase the overall complexity of the algorithm. 


Algorithm 7: Robinsonian (A) 
input: a nonnegative matrix A G 5” 

output: a PQ-tree T that encodes all the permutations tt such that is a Robinson 
matrix or stating that A is not Robinsonian 


xi, = (y) 

2 0 = 0 

3 G is the support of A 

4 T=Robinson{A,’ip,a) 

5 if the number of leaves of T is equal to n then 

6 I return: T 

7 else 

8 I “A is NOT Robinsonian” 

9 end 


5 Conclusions 


We introduced a new combinatorial algorithm to recognize Robinsonian matrices, based on a 
divide-and-conquer strategy and on a new characterization of Robinsonian matrices in terms 
of straight enumerations of unit interval graphs. The algorithm is simple, rather intuitive and 
relies only on basic routines like Lex-BFS and partition refinement, and it is well suited for 
sparse matrices. 

The complexity depends on the depth d of the recursion tree. An obvious bound on d is the 
number L of distinct entries in the matrix. A first natural question is to find other better bounds 
on the depth d. Is d in the order 0(n), where n is the size of the matrix? The answer is no: 
some computational experiments carried out in |36[ show that, for some instances, the depth of 
the recursion tree is d = L > n. This suggests that more sophisticated modifications might be 
needed to improve the complexity of the algorithm. A possible way to bound the depth is to find 
criteria to prune recursion nodes. One possibility would be, when a submatrix is found for which 
the current weak linear order consists only of singletons, to check whether the corresponding 
permuted matrix is Robinson. Another possible way to improve the complexity might be to 
compute the straight enumeration of the first level graph and then update it dynamically (in 
constant time, using a appropriate data structure) without having to compute every time the 
whole straight enumeration of the next level graphs; this would need to extend the dynamic 
approach of (^, which considers the case of single edge deletions, to the deletion of sets of 
edges. Other possible future work includes investigating how the algorithm could be used to 
design heuristics or approximation algorithm in the noisy case, when A is not Robinsonian, for 


example by using (linear) certifying algorithms as in 20 to detect the edges and the nodes of 


the level graphs which create obstructions to being a unit interval graph. 
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A Example 


We give a complete example of the algorithm. We consider the same matrix A as the one used 
in the example in Section 5 of 30 . However, since |30[ handles Robinsonian dissimilarities, we 
first transform it into a similarity matrix and thus we use instead the matrix Omax*/ — A, where 
Omax denotes the largest entry in the matrix A. If we rename such a new matrix as A, it looks 
as follows: 


1 

2 

3 

4 

5 

6 
7 


1 2 3 4 5 6 7 

/II 290505 
11 2 0 9 0 8 

11 0 5 0 5 

11 0 3 0 

11 0 8 
11 0 
11 


9 

A = 10 
11 
12 

13 

14 

15 

16 

17 

18 

19 V 


8 

5 

5 

5 

0 

7 

0 

7 

11 


9 10 
2 0 

10 0 

2 0 
0 3 

9 0 

0 10 
8 0 
6 0 
11 0 

11 


11 12 
5 0 

5 0 

5 0 

0 3 

7 0 

0 6 
7 0 

10 0 

6 0 
0 6 
11 0 

11 


13 14 

5 6 

5 2 

5 10 

0 0 
7 5 

0 0 

7 5 

8 7 

5 2 

0 0 

9 7 

0 0 
11 7 

11 


15 16 17 18 19 

0 0 2 0 5 \ 

0 0 10 0 8 

0 0 2 0 5 

10 3 0 9 0 

0 0 9 0 10 

5 8 0 5 0 

0 0 8 0 9 

0 0 6 0 7 

0 0 10 0 8 

4 9 0 5 0 

0 0 6 0 7 

9 6 0 10 0 

0 0 5 0 7 

0 0 2 0 5 

11 4 0 10 0 

11 0 4 0 

11 0 8 

11 0 

11 / 


Here the red labels denote the original numbering of the elements. Throughout we will use the 
fact that adding any multiple of the all-ones matrix J to the matrix A does not change the 
Robinson(ian) property. The recursion tree computed by Algorithmis shown in Figure]^ at 
page The weak linear order at each node represents the weak linear order ijj given as input 
to the recursion node, while the number on the edge between two nodes denotes the minimum 
value in the current matrix A, which is set to zero before making a new recursion call (in this 
way, the reader may reconstruct the input given at each recursion node). 


Root node 

We set i/) = iy) and invoke Algorithm]^ Then, Algorithmwould find two connected compo¬ 
nents: 


Hi = {1,2,3,5,7,8,9,11,13,14,17,19}, 

Hs = {4,6,10,12,15,16,18}. 

Hence, we can split the problem into two subproblems, where we deal with each connected 
component independently. 

1.0 Connected component Vi, level 0 

The submatrix A[Vi] induced by Vi is shown below (after shifting the matrix by amin = 2, i.e., 
substracting 2J from it). 
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1 

2 

3 

5 

7 

8 
9 
11 

13 

14 
17 
19 


1 

/9 


2 

0 

9 


3 5 7 8 9 

7 3 3 3 0 

0 7 6 3 8 

9 3 3 3 0 

9 6 5 7 

9 5 6 
9 4 
9 


11 

3 

3 

3 
5 
5 
8 

4 
9 


13 14 17 19 

3 4 0 3 \ 

3 0 8 6 

3 8 0 3 

5 3 7 8 

5 3 6 7 

6 5 4 5 

3 0 8 6 

7 5 4 5 

9 5 3 5 

9 0 3 

9 6 

9 / 


If we invoke Algorithm on the support G[Vi], we get the following straight enumeration: 


( 4 ) 


^ = ({1,3,14}, {5, 7,8,11,13,19}, {2, 9,17}). 


Note that since "(/^[Vi] has only one block, we do not need to compute the partition refinement 
in Algorithm and then the common refinement is simply 4* = (/>. The smallest nonzero value 
of the matrix in Q is = 3. Hence, we compute A'[17i] by setting to zero the entries of the 
matrix in Q with value equal to Since A'[Vi] is not diagonal, we make a recursion call, 

and we set ^ = 4>. To simplify notation, we shall rename A'[Vi] as A\y\\ after every iteration. 

1.1 Connected component Vi, level 1 

The input matrix A[Vi] is obtained by setting to zero the entries of the matrix in Q with value 
at most 3. The support of this matrix is still connected, and its straight enumeration is: 


</-= ({1,3}, {14}, {13}, {8,11}, {5, 7,19}, {9,17}, {2}). 

If we invoke Algorithm it is easy to see that the common refinement ^ oi ijj and (j) is exactly 
(j). The smallest nonzero value of A\yi] is now = 4. Hence, we compute A'[l/i], which is 
not diagonal and thus we make a recursion call, setting ?/; = $ and renaming A'[1 /l] as A[l/i]. 

1.2 Connected component Vi, level 2 

The input matrix A[Vi] is obtained by setting to zero the entries of the matrix in Q with value 
at most 4. The support of this matrix is still connected, and its straight enumeration is: 

= ({1}, {3}, {14}, {13, 8,11}, {5, 7,19}, {9,17, 2}). 

The common refinement with ^ is then given by: 

ch = ({1}, {3}, {14}, {13}, {8,11}, {5, 7,19}, {9,17}, {2}). 

The smallest nonzero value of A[Vi] is now = 5. Hence, we compute A'[Vi], which is not 
diagonal and thus we make a recursion call, setting ijj = ^ and renaming A'[17i] as A[Vi]. 

1.3 Connected component Vi, level 3 

The input matrix A[Vi] is obtained by setting to zero the entries of the matrix in Q with value 
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at most 5. The support of this matrix is not connected, and thus Algorithm will detect the 
following connected components: 

Fii = {1,3,14}, 

1^12 = {13,8,11}, 

Fi3 = {5,7, 19,9,17,2}. 

We analyze each connected component independently. 

1.3.1 Connected component Vii, level 4 

The common refinement is ‘h = ({1}, {3}, {14}) and = 7. We make a recursion call, finding 
two connected components {1} and {3,14}, and then we stop, because the first one has only 
one vertex, while the submatrix corresponding to the second one is diagonal (after shifting). 

1.3.2 Connected component V 12 , level 4 

Since amin = 6, we hrst “shift” the input submatrix. Then, the common refinement is = 
({13}, {11}, {8}) and = 7 (i.e. =1 after shifting). We make a recursion call, finding two 
connected components {13} and {11,8}, and then we stop, because the first one has only one 
vertex, while the submatrix corresponding to the second one is diagonal (after shifting). 

1.3.3 Connected component V 13 , level 4 

Since amin = 6, we first “shift” the input submatrix. Then, the common refinement is 4> = 
({7}, {19}, {5}, {9,17}, {2}) and = 7 (i.e. =1 after shifting). We update and 

we make a recursive call because it is not diagonal. The new input matrix is then given by 
the submatrix in Q restricted to V 13 by setting to zero the entries with value at most 7. The 
support of this matrix is not connected, and thus Algorithmwill detect the following connected 
components: 

^131 = {7}, 

Vh32 = {19,5}, 

^133 = {9,17, 2}. 

We then split the problem over the connected components. The first one has only one vertex 
while the second and the third one are diagonal (after shifting). This was the last recursion 
node open. 

Therefore, we get that the final common refinement of the level graphs of the matrix in Q is: 
= ({1},{3},{14},{13},{11},{8},{7},{19},{5},{9,17}, {2}) 

and the PQ-tree 7} computed by the algorithm is reported in Figure]^ at page 

2.0 Connected component V 25 level 0 

The submatrix A[V 2 ] induced by V 2 is reported below (after shifting the matrix by amin = 3). 


A[F2] 


4 6 
4/8 0 
6 
10 
12 

15 

16 
18 


10 12 15 16 18 

0 0 7 0 6 \ 

7 3 2 5 2 

8 3 16 2 

8 6 3 7 

8 1 7 

8 1 

8 / 


( 5 ) 


V 
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If we invoke Algorithm on the support G[V 2 ], we get the following straight enumeration: 

0= ({4}, {15,18}, {6,10,12,16}). 


Note that since V’[^] has only one block, we do not have to compute the partition refinement 
in Algorithmic and then the common refinement is simply ^ = (j). The smallest nonzero value 
of the matrix in (jC is = 1. Hence, we compute A'[V 2 ] by setting to zero the entries of 
the matrix in (jC with value equal to Since A'[V 2 ] is not diagonal, we make a recursion 

call, and we set ?/) = <I>. To simplify notation, we shall again rename A'[V 2 ] as A[V 2 ] after every 
iteration. 

2.1 Connected component V 2 , level 1 

The input matrix A[V 2 ] is obtained by setting to zero the entries of the matrix in (jC with value 
at most 1. The support of this matrix is still connected, and its straight enumeration is: 

^ = ({4},{15},{18},{6,12},{10}, {16}). 

If we invoke Algorithm [C it is easy to see that the common refinement is ^ = 4> The smallest 
nonzero value of A[V 2 ] is now = 2. Hence, we compute A'[V 2 ]) which is not diagonal and 
thus we make a recursion call, setting ^ = 4> and renaming A'[V 2 ] as A[V 2 ] for the next iteration. 

2.2 Connected component V 2 , level 2 

The input matrix A[V 2 ] is obtained by setting to zero the entries of the matrix in (jC with value 
at most 2. The support of this matrix is still connected, and its straight enumeration is: 

<^ = ({4,15}, {18}, {12}, {6,10,16}). 

The common refinement is then simply: 

4> = ({4}, {15}, {18}, {12}, {6}, {10}, {16}). 

The smallest nonzero value of A[V 2 ] is now = 3. Hence, we compute A'[V 2 ], which is not 
diagonal and thus we make a recursion call, setting -ip = ^ and renaming A^[V 2 ] as A[l^] for the 
next iteration. 

2.3 Connected component V 2 , level 3 

The input matrix A[V 2 ] is obtained by setting to zero the entries of the matrix in (j^ with value 
at most 3. The support of this matrix is still connected, and its straight enumeration is: 

cP = ({4}, {15}, {18}, {12}, {6}, {10}, {16}) 

which is equal to the given ip (and thus will be the common refinement 4>). The smallest nonzero 
value of A[V 2 ] is now = 5. Hence, we compute ^'[ 1 / 2 ], which is not diagonal and thus we 
make a recursion call, setting ip = ^ and renaming A'[V 2 ] as A[V 2 ] for the next iteration. 

2.4 Connected component V 2 , level 4 

The input matrix A[V 2 ] is obtained by setting to zero the entries of the matrix in (j^ with value 
at most 5. The support of this matrix is not connected, and thus Algorithm will detect two 
connected components. 


H 21 = {4,15,18,12}, 
H22 = {6,10,16}. 
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We then split the problem over the connected components. 

2.4.1 Connected component V 215 level 5 

The common refinement is = ({4}, {15}, {18}, {12}) and = 6. We then make a recursion 
call and we find again a connected graph. Again the common refinement does not change (in 
fact the blocks are singletons) and now = 7. Finally, the submatrix A'[V 2 i] is diagonal, so 
we prune the node. 

2.4.2 Connected component V 22 , level 5 

The common refinement is = ({6}, {10}, {16}) (again the blocks are singletons) and = 6. 
But now A'[V 22 ] is diagonal, so we prune the node. This was the last recursion node open. 

Therefore, we get that the hnal common refinement of level graphs of A[V 2 ] is: 


4>2 = ({4},{15},{18},{12}, {6}, {10}, {16}) 


and the PQ-tree Ti computed by the algorithm is shown in Figure at page|31[ Finally, we can 
build the PQ-tree representing the permutation reordering A as a Robinson matrix. Since both 
Vi and V 2 are contained in the same block of "0 (which at the beginning is ip = ([u])), then we 
create a P-node (named a since it is the ancestor) whose children are the subtrees 71 and Ti. 
The final PQ-tree is shown in Figure]^ at page 31, and is equivalent to the one returned by [30] . 

Note that, using the fact that a common refinement which is a linear order cannot be refined 
anymore, the depth could be lowered to d = 4 (since the right branch would have depth d = 3). 
Understanding this and other possible speed up will be the subject of future work. 
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Figure 4: The PQ-tree corresponding to the common refinement of level graphs of A[Vi] 
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Figure 5: The PQ-tree corresponding to the common refinement of level graphs of A[V 2 ] 



Figure 6; The PQ-tree corresponding to the permutations reordering A as a Robinson matrix 
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